| Non-Rationalised Economics NCERT Notes, Solutions and Extra Q & A (Class 9th to 12th) | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9th | 10th | 11th | 12th | ||||||||||||||||
Chapter 5 Measures Of Central Tendency
Introduction to Measures of Central Tendency
Introduction
After organizing and presenting data in tables and graphs, the next step is often to summarize the data using a single representative value. This is where measures of central tendency come in. They are a numerical method to explain a large set of data in brief. We use them frequently in day-to-day life, for example, when we talk about average marks, average rainfall, or average income.
A measure of central tendency summarizes the data in a single value in such a way that this value can represent the entire dataset. It provides a typical or representative value for the distribution.
Example 1. Baiju is a small farmer in Balapur village with 1 acre of land. To understand his economic condition relative to the 50 other small farmers in the village, we need to summarize the data on the land holdings of all farmers. We could find:
- The average size of land holding in the ordinary sense (Arithmetic Mean).
- The land size above which half the farmers fall (Median).
- The land size that most of the farmers own (Mode).
These measures help us evaluate Baiju's relative economic condition by comparing his land holding to a single representative value for the entire village.
Commonly Used Measures of Central Tendency
There are several statistical measures of central tendency, or "averages". The three most commonly used are:
- Arithmetic Mean
- Median
- Mode
Other types of averages, like Geometric Mean and Harmonic Mean, are suitable for specific situations, but this discussion will focus on the three main types.
Arithmetic Mean
The Arithmetic Mean is the most commonly used measure of central tendency. It is defined as the sum of the values of all observations divided by the number of observations. It is usually denoted by $\bar{X}$.
Formula for Ungrouped Data
If there are N observations $X_1, X_2, X_3, \dots, X_N$, then the Arithmetic Mean is given by:
$\bar{X} = \frac{X_1 + X_2 + X_3 + \dots + X_N}{N} = \frac{\sum X}{N}$
Where $\sum X$ is the sum of all observations and $N$ is the total number of observations.
Calculation Methods for Ungrouped Data
1. Direct Method
This method involves summing all the observations and dividing by the number of observations.
Example 1. Calculate the arithmetic mean of the marks: 40, 50, 55, 78, 58.
$\bar{X} = \frac{40 + 50 + 55 + 78 + 58}{5} = \frac{281}{5} = 56.2$
2. Assumed Mean Method
When the number of observations is large or the figures are large, this method simplifies calculations. An "assumed mean" (A) is chosen, and the mean is calculated based on the deviations (d) from this assumed mean.
$\bar{X} = A + \frac{\sum d}{N}$, where $d = X - A$.
3. Step Deviation Method
This method further simplifies calculations by dividing the deviations by a common factor 'c'.
$\bar{X} = A + \frac{\sum d'}{N} \times c$, where $d' = \frac{X - A}{c}$.
Calculation Methods for Grouped Data
For grouped data (both discrete and continuous series), the methods are similar, but frequencies (f) are taken into account.
Direct Method (Discrete/Continuous Series)
For a continuous series, the mid-point (m) of each class interval is used as X.
$\bar{X} = \frac{\sum fX}{\sum f}$ or $\bar{X} = \frac{\sum fm}{\sum f}$
Assumed Mean and Step Deviation Methods (Discrete/Continuous Series)
The formulas are adapted to include frequencies:
Assumed Mean Method: $\bar{X} = A + \frac{\sum fd}{\sum f}$
Step Deviation Method: $\bar{X} = A + \frac{\sum fd'}{\sum f} \times c$
Properties of Arithmetic Mean
- The sum of deviations of items about the arithmetic mean is always equal to zero. Symbolically, $\sum (X - \bar{X}) = 0$.
- The arithmetic mean is affected by extreme values. A very large or very small value can significantly push the mean up or down.
Weighted Arithmetic Mean
Sometimes, it is necessary to assign different levels of importance, or 'weights' (W), to different items when calculating the mean. The weighted arithmetic mean is given by:
$\bar{X}_w = \frac{\sum WX}{\sum W}$
This is useful when calculating average prices, where the quantities consumed act as weights.
Median and Quartiles
Median
The Median is the positional value of a variable that divides the distribution into two equal parts. It is the "middle" element when the data set is arranged in ascending or descending order. One part of the distribution comprises all values greater than or equal to the median, and the other part comprises all values less than or equal to it.
A key feature of the median is that it is not sensitive to extreme values. For example, in the series 1, 2, 3000, the median is 2, whereas the mean is 1001. The median provides a better measure of central tendency when the data contains outliers.
Computation of Median
Ungrouped Data
- Arrange the data in ascending or descending order.
- Find the position of the median using the formula: Position = $(\frac{N+1}{2})^{th}$ item.
- If N is odd, the median is the middle value.
- If N is even, the median is the arithmetic mean of the two middle values.
Grouped Data (Discrete Series)
- Arrange the data and calculate the cumulative frequency (c.f.).
- Find the position of the median using the formula: Position = $(\frac{N+1}{2})^{th}$ item.
- Locate this position in the cumulative frequency column. The corresponding value of the variable is the median.
Grouped Data (Continuous Series)
- Find the median class where the $(\frac{N}{2})^{th}$ item lies.
- Apply the following formula to find the median value:
Median = $L + \frac{(\frac{N}{2} - c.f.)}{f} \times h$
Where:
- $L$ = lower limit of the median class.
- $c.f.$ = cumulative frequency of the class preceding the median class.
- $f$ = frequency of the median class.
- $h$ = magnitude (width) of the median class interval.
Quartiles and Percentiles
Quartiles
Quartiles are measures that divide the data into four equal parts. There are three quartiles:
- First Quartile ($Q_1$) or Lower Quartile: Has 25% of the items below it and 75% above it.
- Second Quartile ($Q_2$): This is the Median. It has 50% of items below it and 50% above it.
- Third Quartile ($Q_3$) or Upper Quartile: Has 75% of the items below it and 25% above it.
The formulas for $Q_1$ and $Q_3$ for ungrouped/discrete series are:
$Q_1 =$ size of $(\frac{N+1}{4})^{th}$ item
$Q_3 =$ size of $3(\frac{N+1}{4})^{th}$ item
Percentiles
Percentiles divide the distribution into one hundred equal parts. There are 99 percentiles ($P_1, P_2, \dots, P_{99}$). The median is the 50th percentile ($P_{50}$).
Mode
Definition and Use
The Mode is the value that occurs most frequently in a series of data. It is the value around which there is the maximum concentration of items. The word 'mode' comes from the French "la Mode," which signifies the most fashionable value. It is denoted by $M_o$.
The mode is the most appropriate measure of central tendency for qualitative data or when a business wants to know the most typical or popular item. For example, a shoe manufacturer would be interested in the shoe size that has the maximum demand (the modal size).
Unlike the mean and median, the mode is not necessarily unique. A distribution can be:
- Unimodal: Has one mode.
- Bimodal: Has two modes.
- Multimodal: Has more than two modes.
- It may have no mode at all if no value is repeated.
Computation of Mode
Ungrouped and Discrete Series
The mode is found by simple inspection. It is the value of the variable that has the highest frequency.
Example 1. For the dataset 1, 2, 3, 4, 4, 5, the mode is 4 because it occurs twice, more than any other value.
Continuous Series
- Identify the modal class, which is the class with the highest frequency.
- Apply the following formula to calculate the mode:
Mode ($M_o$) = $L + \frac{D_1}{D_1 + D_2} \times h$
Where:
- $L$ = lower limit of the modal class.
- $D_1$ = difference between the frequency of the modal class and the frequency of the preceding class (ignoring signs).
- $D_2$ = difference between the frequency of the modal class and the frequency of the succeeding class (ignoring signs).
- $h$ = class interval of the modal class.
For this formula to be applied, the class intervals should be equal and the series should be in an exclusive form.
Relative Position of Mean, Median, and Mode, and Conclusion
Relative Position of Arithmetic Mean, Median, and Mode
In a frequency distribution, the relative positions of the mean, median, and mode depend on the shape (symmetry) of the distribution.
- In a symmetrical distribution, the Mean, Median, and Mode are all equal (Mean = Median = Mode).
- In an asymmetrical (skewed) distribution, these values are not equal.
- For a positively skewed distribution (skewed to the right), the relationship is: Mean > Median > Mode.
- For a negatively skewed distribution (skewed to the left), the relationship is: Mean < Median < Mode.
In all cases of skewness, the median is always located between the arithmetic mean and the mode.
Conclusion
Measures of central tendency summarize a dataset with a single, most representative value. The choice of which average to use depends on the purpose of the analysis and the nature of the data distribution.
| Average | Key Characteristics and Use Cases |
|---|---|
| Arithmetic Mean |
|
| Median |
|
| Mode |
|
Selecting the appropriate average is crucial for a meaningful and accurate summary of the data.
NCERT Questions Solution
Question 1. Which average would be suitable in the following cases?
(i) Average size of readymade garments.
(ii) Average intelligence of students in a class.
(iii) Average production in a factory per shift.
(iv) Average wage in an industrial concern.
(v) When the sum of absolute deviations from average is least.
(vi) When quantities of the variable are in ratios.
(vii)In case of open-ended frequency distribution.
Answer:
Question 2. Indicate the most appropriate alternative from the multiple choices provided against each question.
(i) The most suitable average for qualitative measurement is
(a) arithmetic mean
(b) median
(c) mode
(d) geometric mean
(e) none of the above
(ii) Which average is affected most by the presence of extreme items?
(a) median
(b) mode
(c) arithmetic mean
(d) none of the above
(iii) The algebraic sum of deviation of a set of n values from A.M. is
(a) n
(b) 0
(c) 1
(d) none of the above
Answer:
Question 3. Comment whether the following statements are true or false.
(i) The sum of deviation of items from median is zero.
(ii) An average alone is not enough to compare series.
(iii) Arithmetic mean is a positional value.
(iv) Upper quartile is the lowest value of top 25% of items.
(v) Median is unduly affected by extreme observations.
Answer:
Question 4. If the arithmetic mean of the data given below is 28, find (a) the missing frequency, and (b) the median of the series:
| Profit per retail shop (in Rs) | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 | 50-60 |
|---|---|---|---|---|---|---|
| Number of retail shops | 12 | 18 | 27 | - | 17 | 6 |
Answer:
Question 5. The following table gives the daily income of ten workers in a factory. Find the arithmetic mean.
| Workers | A | B | C | D | E | F | G | H | I | J |
|---|---|---|---|---|---|---|---|---|---|---|
| Daily Income (in Rs) | 120 | 150 | 180 | 200 | 250 | 300 | 220 | 350 | 370 | 260 |
Answer:
Question 6. Following information pertains to the daily income of 150 families. Calculate the arithmetic mean.
| Income (in Rs) | Number of families |
|---|---|
| More than 75 | 150 |
| ,, 85 | 140 |
| ,, 95 | 115 |
| ,, 105 | 95 |
| ,, 115 | 70 |
| ,, 125 | 60 |
| ,, 135 | 40 |
| ,, 145 | 25 |
Answer:
Question 7. The size of land holdings of 380 families in a village is given below. Find the median size of land holdings.
| Size of Land Holdings (in acres) | Number of families |
|---|---|
| Less than 100 | 40 |
| 100–200 | 89 |
| 200–300 | 148 |
| 300–400 | 64 |
| 400 and above. | 39 |
Answer:
Question 8. The following series relates to the daily income of workers employed in a firm. Compute (a) highest income of lowest 50% workers (b) minimum income earned by the top 25% workers and (c) maximum income earned by lowest 25% workers.
| Daily Income (in Rs) | 10–14 | 15–19 | 20–24 | 25–29 | 30–34 | 35–39 |
|---|---|---|---|---|---|---|
| Number of workers | 5 | 10 | 15 | 20 | 10 | 5 |
(Hint: compute median, lower quartile and upper quartile.)
Answer:
Question 9. The following table gives production yield in kg. per hectare of wheat of 150 farms in a village. Calculate the mean, median and mode values.
| Production yield (kg. per hectare) | Number of farms |
|---|---|
| 50–53 | 3 |
| 53–56 | 8 |
| 56–59 | 14 |
| 59–62 | 30 |
| 62–65 | 36 |
| 65–68 | 28 |
| 68–71 | 16 |
| 71–74 | 10 |
| 74–77 | 5 |
Answer: